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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE -TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AAGCTTCAAT AGAATCTCAA ATCTCGATGA CTGCTTAGTC TTTTTAATAC TGTCTACTTG 60 

ACAGGGGTAG GAACATAATC GGTCAATTTT AAATATGGGG CATATATTGA TATTTTATAA 120 

AATTTGTTAC GTTTTTTGTA TTTTTTCATA AGATGTGTCA TATGTATTAA ATCGTGGTAA 180 

TGAAAAACAG TATCAAACTA TCAGAACTTT GGTAGTTTAA TAAAAAAACG GAGGTATTTT 240 

ATGGAGGAAA ATAATCAAAA TC AATG CAT A CCTTACAATT GTTTAAGTAA TCCTGAAGAA 300 

GTACTTTTGG ATGGAGAACG GATATCAACT GGTAATTACT CAATTGATAT TTCTCTGTCA 3 60 

CTTGTTCAGT TTCTGGTATC TAACTTTGTA CCAGGGGGAG GATTTTTAGT TGGATTAATA 420 

GATTTTGTAT GGGGAATAGT TGGCCCTTCT CAATGGGATG CATTTCTAGT ACAAATTGAA 480 

CAATTAATTA ATGAAAGAAT AGCTGAATTT GCTAGGAATG CTGCTATTGC TAATTTAGAA 540 

GGATTAGGAA ACAATTTCAA TATATATGTG GAAGCATTTA AAGAATGGGA AGAAGATCCT 600 

AATAATCCAG CAACCAGGAC CAGAGTAATT GATCGCTTTC GTATACTTGA TGGGCTACTT 660 

GAAAGGGACA TTCCTTCGTT TCGAATTTCT GGATTTGAAG TACCCCTTTT ATCCGTTTAT 720 

GCTCAAGCGG CCAATCTGCA TCTAGCTATA TTAAGAGATT CTGTAATTTT TGGAGAAAGA 780 

TTGGGATTGA CAACGATAAA TGTCAATGAA AACTATAATA GACTAATTAG GCATATTGAT 840 

GAATATGCTG ATCACTGTGC AAATACGTAT AATCGGGGAT TAAATAATTT ACCGAAATCT 900 

ACGTATCAAG ATTGGATAAC ATATAATCGA TTACGGAGAG ACTTAACATT GACTGTATTA 960 

GATATCGCCG CTTTCTTTCC AAACTATGAC AATAGGAGAT ATCCAATTCA GCCAGTTGGT 1020 

CAACTAACAA GGGAAGTTTA TACGGACCCA TTAATTAATT TTAATCCACA GTTACAGTCT 1080 
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GTAGCTCAAT 


TACCTACTTT 


TAACGTTATG 


GAGAGCAGCG 


CAATTAGAAA 


TCCTCATTTA 


1140 


TTTGATATAT 


TGAATAATCT 


TACAATCTTT 


ACGGATTGGT 


TTAGTGTTGG 


ACG CAATTTT 


1200 


TATTGGGGAG 


GACATCGAGT 


AATATCTAGC 


CTTATAGGAG 


GTGGTAACAT 


AACATCTCCT 


1260 


ATATATGGAA 


GAGAGGCGAA 


CCAGGAGCCT 


CCAAGATCCT 


TTACTTTTAA 


TGGACCGGTA 


1320 


TTTAGGACTT 


TATCAATTCC 


TACTTTACGA 


TTATTACAGC 


AACCTTGCCA 


GCGCCACCAT 


1380 


TTTAATTTAC 


GTGGTGGTGA 


AGGAGTAGAA 


TTTTCTACAC 


CTACAAATAG 


CTTTACGTAT 


1440 


GCAGGAAGAG 


GTACGGTTGA 


TTCTTTAACT 


GAATTACCGC 


CTGAGGATAA 


TAGTGTGCCA 


1500 


CCTCGCGAAG 


GATATAGTCA 


TCGTTTATGT 


CATGCAACTT 


TTGTTCAAAG 


ATCTGGAACA 


1560 


CCTTTTTTAA 


CAACTGGTGT 


AGTATTTTCT 


TGGACGCATC 


GTAGTGCAAC 


TCTTACAAAT 


1620 


ACAATTGATC 


CAGAGAGAAT 


TAATCAAATA 


CCTTTAGTGA 


AAGGATTTAG 


AGTTTGGGGG 


1680 


GGCACCTCTG 


TCATTACAGG 


ACCAGGATTT 


ACAGGAGGGG 


ATATCCTTCG 


AAGAAATACC 


1740 


TTTGGTGATT 


TTGTATCTCT 


ACAAGTCAAT 


ATTAATTCAC 


CAATTACCCA 


AAGATACCGT 


1800 


TTAAGATTTC 


GTTACGCTTC 


CAGTAGGGAT 


GCAGCAGTTA 


TAGTATTAAC 


AGG AG CGGCA 


1860 


TCCACAGGAG 


TGGGAGGCCA 


AGTTAGTGTA 


GATATGCCTC 


TTCAGAAAAC 


TATGGAAATA 


1920 


GGGGAGAACT 


TAACATCTAG 


AACATTTAGA 


TATACCGATT 


TTAGTAATCC 


TTTTTCATTT 


1980 


AGAGCTAATC 


CAGATATAAT 


TGGGATAAGT 


GAACAACCTC 


TATTTGGTGC 


AGGTTCTATT 


2040 


AGTAGCGTTG 


AACTTTATAT 


AGATAAAATT 


GAAATTATTC 


TAGCAGATGC 


AACATTTGAA 


2100 


GCAGAATCTG 


ATTTAGAAAG 


AG C ACAAAAG 


GCGGTGAATG 


CCCTGTTTAC 


TTCTTCCAAT 


2160 


CAAATCGGGT 


TAAAAACCGA 


TGTGACGGAT 


TATCATATTG 


ATCAAGTATC 


CAATTTAGTG 


2220 


GATTGTTTAT 


CAGATGAATT 


TTGTCTGGAT 


GAAAAGCGAG 


AATTGTCCGA 


GAAAGTCAAA 


2280 


CATGCGAAGC 


GACTCAGTGA 


TGAGCGGAAT 


TTACTTCAAG 


ATCCAAACTT 


CAGAGGGATC 


2340 


AATAGACAAC 


CAGACCGTGG 


CTGGAGAGGA 


AGTACAGATA 


TTACCATCCA 


AGGAGGAGAT 


2400 


GACGTATTCA 


AAGAGAATTA 


CGTCACACTA 


CCGGGTACCG 


TTGATGAGTG 


CTATCCAACG 


2460 


TATTTATATC 


AGAAAATAGA 


TGAGTCGAAA 


TTAAAAGCTT 


ATACCCGTTA 


TGAATTAAGA 


2520 


GGGTATATCG 


AAGATAGTCA 


AGACTTAGAA 


ATCTATTTGA 


TCGCGTACAA 


TGCAAAACAC 


2580 


GAAATAGTAA 


ATGTGCCAGG 


CACGGGTTCC 


TTATGGCCGC 


TTTCAGCCCA 


AAGTCCAATC 


2640 


GGAAAGTGTG 


GAGAACCGAA 


TCGATGCGCG 


CCACACCTTG 


AATGGAATCC 


TGATCTAGAT 


2700 


TGTTCCTGCA 


G 










2711 
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(2) INFORMATION FOR SEQ I&NO:2: <$! 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Glu Asn Asn Gin Asn Gin Cys lie Pro Tyr Asn Cys Leu Ser 
1-5 10 15 

Asn Pro Glu Glu Val Leu Leu Asp Gly Glu Arg lie Ser Thr Gly Asn 
20 25 30 

Ser Ser lie Asp lie Ser Leu Ser Leu Val Gin Phe Leu Val Ser Asn 
35 40 45 

Phe Val Pro Gly Gly Gly Phe Leu Val Gly Leu lie Asp Phe Val Trp 
50 55 60 

Gly lie Val Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie Glu 
65 70 75 80 

Gin Leu lie Asn Glu Arg lie Ala Glu Phe Ala Arg Asn Ala Ala lie 



Ala Asn Leu Glu Gly Leu Gly Asn Asn Phe Asn lie Tyr Val Glu Ala 
100 105 110 

Phe Lys Glu Trp Glu Glu Asp Pro Asn Asn Pro Ala Thr Arg Thr Arg 
115 120 125 

Val lie Asp Arg Phe Arg lie Leu Asp Gly Leu Leu Glu Arg Asp lie 
130 135 140 

Pro Ser Phe Arg lie Ser Gly Phe Glu Val Pro Leu Leu Ser Val Tyr 
145 150 155 160 

Ala Gin Ala Ala Asn Leu His Leu Ala lie Leu Arg Asp Ser Val lie 

165 170 175 

Phe Gly Glu Arg Trp Gly Leu Thr Thr lie Asn Val Asn Glu Asn Tyr 
180 185 190 

Asn Arg Leu lie Arg His lie Asp Glu Tyr Ala Asp His Cys Ala Asn 
195 200 205 

Thr Tyr Asn Arg Gly Leu Asn Asn Leu Pro Lys Ser Thr Tyr Gin Asp 
210 215 220 



90 
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Trp lie Thr Tyr Asn Arg Leu Arg Arg Asp Leu Thr Leu Thr Val Leu 
225 230 235 240 

Asp lie Ala Ala Phe Phe Pro Asn Tyr Asp Asn Arg Arg Tyr Pro lie 

245 250 255 

Gin Pro Val Gly Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Leu lie 
260 265 270 

Asn Phe Asn Pro Gin Leu Gin Ser Val Ala Gin Leu Pro Thr Phe Asn 
275 280 285 

Val Met Glu Ser Ser Ala lie Arg Asn Pro His Leu Phe Asp lie Leu 
290 295 300 

Asn Asn Leu Thr lie Phe Thr Asp Trp Phe Ser Val Gly Arg Asn Phe 
305 310 315 320 

Tyr Trp Gly Gly His Arg Val lie Ser Ser Leu lie Gly Gly Gly Asn 

325 330 335 

lie Thr Ser Pro lie Tyr Gly Arg Glu Ala Asn Gin Glu Pro Pro Arg 
340 345 350 

Ser Phe Thr Phe Asn Gly Pro Val Phe Arg Thr Leu Ser lie Pro Thr 
355 360 365 

Leu Arg Leu Leu Gin Gin Pro Cys Gin Arg His His Phe Asn Leu Arg 
370 375 380 

Gly Gly Glu Gly Val Glu Phe Ser Thr Pro Thr Asn Ser Phe Thr Tyr 
385 390 395 400 

Arg Gly Arg Gly Thr Val Asp Ser Leu Thr Glu Leu Pro Pro Glu Asp 

405 410 415 

Asn Ser Val Pro Pro Arg Glu Gly Tyr Ser His Arg Leu Cys His Ala 
420 425 430 

Thr Phe Val Gin Arg Ser Gly Thr Pro Phe Leu Thr Thr Gly Val Val 
435 440 445 

Phe Ser Trp Thr His Arg Ser Ala Thr Leu Thr Asn Thr lie Asp Pro 
450 455 460 

Glu Arg lie Asn Gin lie Pro Leu Val Lys Gly Phe Arg Val Trp Gly 
465 470 475 480 

Gly Thr Ser Val lie Thr Gly Pro Gly Phe Thr Gly Gly Asp lie Leu 

485 490 495 

Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gin Val Asn lie Asn 
500 505 510 

Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser 
515 520 525 
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Arg Asp Ala Arg Val He Val Leu Thr Gly Ala Ala Ser Thr Gly Val 
530 535 540 

Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin Lys Thr Met Glu He 
545 550 555 560 

Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr Asp Phe Ser Asn 

565 570 575 

Pro Phe Ser Phe Arg Ala Asn Pro Asp He He Gly He Ser Glu Gin 
580 585 590 

Pro Leu Phe Gly Ala Gly Ser He Ser Ser Gly Glu Leu Tyr He Asp 
595 600 605 

Lys He Glu I-le He Leu Ala Asp Ala Thr Phe Glu Ala Glu Ser Asp 
610 615 620 

Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn 
625 630 635 640 

Gin He Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val 

645 650 655 

Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys 
660 665 670 

Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu 
675 680 685 

Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin Pro 
690 695 700 

Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asp 
705 710 715 720 

Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Val Asp Glu 

725 730 735 

Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys 
740 745 750 

Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp 
755 760 765 

Leu Glu He Tyr Leu He Ala Tyr Asn Ala Lys His Glu He Val Asn 
770 775 780 

Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro He 
785 790 795 800 

Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn 

805 810 815 



Pro Asp Leu Asp Cys Ser Cys 
820 



